137 research outputs found

    Variant Surface Antigens Of Malaria Parasites: Functional And Evolutionary Insights From Comparative Gene Family Classification And Analysis

    Get PDF
    Background Plasmodium parasites, the causative agents of malaria, express many variant antigens on cell surfaces. Variant surface antigens (VSAs) are typically organized into large subtelomeric gene families that play critical roles in virulence and immune evasion. Many important aspects of VSA function and evolution remain obscure, impeding our understanding of virulence mechanisms and vaccine development. To gain further insights into VSA function and evolution, we comparatively classified and examined known VSA gene families across seven Plasmodium species. Results We identified a set of ultra-conserved orthologs within the largest Plasmodium gene family pir, which should be considered as high-priority targets for experimental functional characterization and vaccine development. Furthermore, we predict a lipid-binding domain in erythrocyte surface-expressed PYST-A proteins, suggesting a role of this second largest rodent parasite gene family in host cholesterol salvage. Additionally, it was found that PfMC-2TM proteins carry a novel and putative functional domain named MC-TYR, which is conserved in other P. falciparum gene families and rodent parasites. Finally, we present new conclusive evidence that the major Plasmodium VSAs PfEMP1, SICAvar, and SURFIN are evolutionarily linked through a modular and structurally conserved intracellular domain. Conclusion Our comparative analysis of Plasmodium VSA gene families revealed important functional and evolutionary insights, which can now serve as starting points for further experimental studies

    Genome Comparison of Human and Non-Human Malaria Parasites Reveals Species Subset-Specific Genes Potentially Linked to Human Disease

    Get PDF
    Genes underlying important phenotypic differences between Plasmodium species, the causative agents of malaria, are frequently found in only a subset of species and cluster at dynamically evolving subtelomeric regions of chromosomes. We hypothesized that chromosome-internal regions of Plasmodium genomes harbour additional species subset-specific genes that underlie differences in human pathogenicity, human-to-human transmissibility, and human virulence. We combined sequence similarity searches with synteny block analyses to identify species subset-specific genes in chromosome-internal regions of six published Plasmodium genomes, including Plasmodium falciparum, Plasmodium vivax, Plasmodium knowlesi, Plasmodium yoelii, Plasmodium berghei, and Plasmodium chabaudi. To improve comparative analysis, we first revised incorrectly annotated gene models using homology-based gene finders and examined putative subset-specific genes within syntenic contexts. Confirmed subset-specific genes were then analyzed for their role in biological pathways and examined for molecular functions using publicly available databases. We identified 16 genes that are well conserved in the three primate parasites but not found in rodent parasites, including three key enzymes of the thiamine (vitamin B1) biosynthesis pathway. Thirteen genes were found to be present in both human parasites but absent in the monkey parasite P. knowlesi, including genes specifically upregulated in sporozoites or gametocytes that could be linked to parasite transmission success between humans. Furthermore, we propose 15 chromosome-internal P. falciparum-specific genes as new candidate genes underlying increased human virulence and detected a currently uncharacterized cluster of P. vivax-specific genes on chromosome 6 likely involved in erythrocyte invasion. In conclusion, Plasmodium species harbour many chromosome-internal differences in the form of protein-coding genes, some of which are potentially linked to human disease and thus promising leads for future laboratory research

    Mos1-Mediated Transgenesis to Probe Consequences of Single Gene Mutations in Variation-Rich Isolates of Caenorhabditis elegans

    Get PDF
    Caenorhabditis elegans, especially the N2 isolate, is an invaluable biological model system. Numerous additional natural C. elegans isolates have been shown to have unexpected genotypic and phenotypic variations which has encouraged researchers to use next generation sequencing methodology to develop a more complete picture of genotypic variations among the isolates. To understand the phenotypic effects of a genomic variation (GV) on a single gene, in a variation-rich genetic background, one should analyze that particular GV in a well understood genetic background. In C. elegans, the analysis is usually done in N2, which requires extensive crossing to bring in the GV. This can be a very time consuming procedure thus it is important to establish a fast and efficient approach to test the effect of GVs from different isolates in N2. Here we use a Mos1-mediated single-copy insertion (MosSCI) method for phenotypic assessments of GVs from the variation-rich Hawaiian strain CB4856 in N2. Specifically, we investigate effects of variations identified in the CB4856 strain on tac-1 which is an essential gene that is necessary for mitotic spindle elongation and pronuclear migration. We show the usefulness of the MosSCI method by using EU1004 tac-1(or402) as a control. or402 is a temperature sensitive lethal allele within a well-conserved TACC domain (transforming acidic coiled-coil) that results in a leucine to phenylalanine change at amino acid 229. CB4856 contains a variation that affects the second exon of tac-1 causing a cysteine to tryptophan change at amino acid 94 also within the TACC domain. Using the MosSCI method, we analyze tac-1 from CB4856 in the N2 background and demonstrate that the C94W change, albeit significant, does not cause any obvious decrease in viability. This MosSCI method has proven to be a rapid and efficient way to analyze GVs

    Large synteny blocks revealed between Caenorhabditis elegans and Caenorhabditis briggsae genomes using OrthoCluster

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Accurate identification of synteny blocks is an important step in comparative genomics towards the understanding of genome architecture and expression. Most computer programs developed in the last decade for identifying synteny blocks have limitations. To address these limitations, we recently developed a robust program called OrthoCluster, and an online database OrthoClusterDB. In this work, we have demonstrated the application of OrthoCluster in identifying synteny blocks between the genomes of <it>Caenorhabditis elegans </it>and <it>Caenorhabditis briggsae</it>, two closely related hermaphrodite nematodes.</p> <p>Results</p> <p>Initial identification and analysis of synteny blocks using OrthoCluster enabled us to systematically improve the genome annotation of <it>C. elegans </it>and <it>C. briggsae</it>, identifying 52 potential novel genes in <it>C. elegans</it>, 582 in <it>C. briggsae</it>, and 949 novel orthologous relationships between these two species. Using the improved annotation, we have detected 3,058 perfect synteny blocks that contain no mismatches between <it>C. elegans </it>and <it>C. briggsae</it>. Among these synteny blocks, the majority are mapped to homologous chromosomes, as previously reported. The largest perfect synteny block contains 42 genes, which spans 201.2 kb in Chromosome V of <it>C. elegans</it>. On average, perfect synteny blocks span 18.8 kb in length. When some mismatches (interruptions) are allowed, synteny blocks ("imperfect synteny blocks") that are much larger in size are identified. We have shown that the majority (80%) of the <it>C. elegans </it>and <it>C. briggsae </it>genomes are covered by imperfect synteny blocks. The largest imperfect synteny block spans 6.14 Mb in Chromosome X of <it>C. elegans </it>and there are 11 synteny blocks that are larger than 1 Mb in size. On average, imperfect synteny blocks span 63.6 kb in length, larger than previously reported.</p> <p>Conclusions</p> <p>We have demonstrated that OrthoCluster can be used to accurately identify synteny blocks and have found that synteny blocks between <it>C. elegans </it>and <it>C. briggsae </it>are almost three-folds larger than previously identified.</p

    Coovar: Co-Occurring Variant Analyzer

    Get PDF
    Background Evaluating the impact of genomic variations (GV) on protein-coding transcripts is an important step in identifying variants of functional significance. Currently available programs for variant annotation depend on external databases or annotate multiple variants affecting the same transcript independently, which limits program use to organisms available in these databases or results in potentially incorrect or incomplete annotations. Findings We have developed CooVar (Co-occurring Variant Analyzer), a database-independent program for assessing the impact of GVs on protein-coding transcripts. CooVar takes GVs, reference genome sequence, and protein-coding exons as input and provides annotated GVs and transcripts as output. Other than similar programs, CooVar considers the combined impact of all GVs affecting the same transcript, generating biologically more accurate annotations. CooVar is operated from the command-line and supports standard file formats VCF, GFF/GTF, and GVF, which makes it easy to integrate into existing computational pipelines. We have extensively tested CooVar on worm and human data sets and demonstrate that it generates correct annotations in only a short amount of time. Conclusions CooVar is an easy-to-use and lightweight variant annotation tool that considers the combined impact of GVs on protein-coding transcripts. CooVar is freely available at http://genome.sfu.ca/projects/coovar/ webcite

    GExplore: a web server for integrated queries of protein domains, gene expression and mutant phenotypes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The majority of the genes even in well-studied multi-cellular model organisms have not been functionally characterized yet. Mining the numerous genome wide data sets related to protein function to retrieve potential candidate genes for a particular biological process remains a challenge.</p> <p>Description</p> <p>GExplore has been developed to provide a user-friendly database interface for data mining at the gene expression/protein function level to help in hypothesis development and experiment design. It supports combinatorial searches for proteins with certain domains, tissue- or developmental stage-specific expression patterns, and mutant phenotypes. GExplore operates on a stand-alone database and has fast response times, which is essential for exploratory searches. The interface is not only user-friendly, but also modular so that it accommodates additional data sets in the future.</p> <p>Conclusion</p> <p>GExplore is an online database for quick mining of data related to gene and protein function, providing a multi-gene display of data sets related to the domain composition of proteins as well as expression and phenotype data. GExplore is publicly available at: <url>http://genome.sfu.ca/gexplore/</url></p

    Combinatorial RNA interference in Caenorhabditis elegans reveals that redundancy between gene duplicates can be maintained for more than 80 million years of evolution

    Get PDF
    BACKGROUND: Systematic analyses of loss-of-function phenotypes have been carried out for most genes in Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster. Although such studies vastly expand our knowledge of single gene function, they do not address redundancy in genetic networks. Developing tools for the systematic mapping of genetic interactions is thus a key step in exploring the relationship between genotype and phenotype. RESULTS: We established conditions for RNA interference (RNAi) in C. elegans to target multiple genes simultaneously in a high-throughput setting. Using this approach, we can detect the great majority of previously known synthetic genetic interactions. We used this assay to examine the redundancy of duplicated genes in the genome of C. elegans that correspond to single orthologs in S. cerevisiae or D. melanogaster and identified 16 pairs of duplicated genes that have redundant functions. Remarkably, 14 of these redundant gene pairs were duplicated before the divergence of C. elegans and C. briggsae 80-110 million years ago, suggesting that there has been selective pressure to maintain the overlap in function between some gene duplicates. CONCLUSION: We established a high throughput method for examining genetic interactions using combinatorial RNAi in C. elegans. Using this technique, we demonstrated that many duplicated genes can retain redundant functions for more than 80 million years of evolution. This provides strong support for evolutionary models that predict that genetic redundancy between duplicated genes can be actively maintained by natural selection and is not just a transient side effect of recent gene duplication events

    Convergent evolution of RFX transcription factors and ciliary genes predated the origin of metazoans

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Intraflagellar transport (IFT) genes, which are critical for the development and function of cilia and flagella in metazoans, are tightly regulated by the Regulatory Factor X (RFX) transcription factors (TFs). However, how and when their evolutionary relationship was established remains unknown.</p> <p>Results</p> <p>We have identified evidence suggesting that RFX TFs and IFT genes evolved independently and their evolution converged before the first appearance of metazoans. Both ciliary genes and RFX TFs exist in all metazoans as well as some unicellular eukaryotes. However, while RFX TFs and IFT genes are found simultaneously in all sequenced metazoan genomes, RFX TFs do not co-exist with IFT genes in most pre-metazoans and thus do not regulate them in these organisms. For example, neither the budding yeast nor the fission yeast possesses cilia although both have well-defined RFX TFs. Conversely, most unicellular eukaryotes, including the green alga <it>Chlamydomonas reinhardtii</it>, have typical cilia and well conserved IFT genes but lack RFX TFs. Outside of metazoans, RFX TFs and IFT genes co-exist only in choanoflagellates including <it>M. brevicollis</it>, and only one fungus <it>Allomyces macrogynus </it>of the 51 sequenced fungus genomes. <it>M. brevicollis </it>has two putative RFX genes and a full complement of ciliary genes.</p> <p>Conclusions</p> <p>The evolution of RFX TFs and IFT genes were independent in pre-metazoans. We propose that their convergence in evolution, or the acquired transcriptional regulation of IFT genes by RFX TFs, played a pivotal role in the establishment of metazoan.</p

    Identification and characterization of novel human tissue-specific RFX transcription factors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Five regulatory factor X (RFX) transcription factors (TFs)–RFX1-5–have been previously characterized in the human genome, which have been demonstrated to be critical for development and are associated with an expanding list of serious human disease conditions including major histocompatibility (MHC) class II deficiency and ciliaophathies.</p> <p>Results</p> <p>In this study, we have identified two additional RFX genes–RFX6 and RFX7–in the current human genome sequences. Both RFX6 and RFX7 are demonstrated to be winged-helix TFs and have well conserved RFX DNA binding domains (DBDs), which are also found in winged-helix TFs RFX1-5. Phylogenetic analysis suggests that the RFX family in the human genome has undergone at least three gene duplications in evolution and the seven human RFX genes can be clearly categorized into three subgroups: (1) RFX1-3, (2) RFX4 and RFX6, and (3) RFX5 and RFX7. Our functional genomics analysis suggests that RFX6 and RFX7 have distinct expression profiles. RFX6 is expressed almost exclusively in the pancreatic islets, while RFX7 has high ubiquitous expression in nearly all tissues examined, particularly in various brain tissues.</p> <p>Conclusion</p> <p>The identification and further characterization of these two novel RFX genes hold promise for gaining critical insight into development and many disease conditions in mammals, potentially leading to identification of disease genes and biomarkers.</p

    Gene Structure Evolution of the Na+-Ca2+ Exchanger (NCX) Family

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Na<sup>+</sup>-Ca<sup>2+ </sup>exchanger (NCX) is an important regulator of cytosolic Ca<sup>2+ </sup>levels. Many of its structural features are highly conserved across a wide range of species. Invertebrates have a single <it>NCX </it>gene, whereas vertebrate species have multiple <it>NCX </it>genes as a result of at least two duplication events. To examine the molecular evolution of <it>NCX </it>genes and understand the role of duplicated genes in the evolution of the vertebrate <it>NCX </it>gene family, we carried out phylogenetic analyses of <it>NCX </it>genes and compared <it>NCX </it>gene structures from sequenced genomes and individual clones.</p> <p>Results</p> <p>A single <it>NCX </it>in invertebrates and the protochordate <it>Ciona</it>, and the presence of at least four <it>NCX </it>genes in the genomes of teleosts, an amphibian, and a reptile suggest that a four member gene family arose in a basal vertebrate. Extensive examination of mammalian and avian genomes and synteny analysis argue that <it>NCX4 </it>may be lost in these lineages. Duplicates for <it>NCX1</it>, <it>NCX2</it>, and <it>NCX4 </it>were found in all sequenced teleost genomes. The presence of seven genes encoding <it>NCX </it>homologs may provide teleosts with the functional specialization analogous to the alternate splicing strategy seen with the three <it>NCX </it>mammalian homologs.</p> <p>Conclusion</p> <p>We have demonstrated that <it>NCX4 </it>is present in teleost, amphibian and reptilian species but has been secondarily and independently lost in mammals and birds. Comparative studies on conserved vertebrate homologs have provided a possible evolutionary route taken by gene duplicates subfunctionalization by minimizing homolog number.</p
    • …
    corecore